Features for Generic Corpus Querying

نویسندگان

  • Thomas Eckart
  • Christoph Kuras
  • Uwe Quasthoff
چکیده

The availability of large corpora for more and more languages enforces generic querying and standard interfaces. This development is especially relevant in the context of integrated research environments like CLARIN or DARIAH. The paper focuses on several applications and implementation details on the basis of a unified corpus format, a unique POS tag set, and prepared data for word similarities. All described data or applications are already or will be in the near future accessible via well-documented RESTful Web services. The target group are all kinds of interested persons with varying level of experience in programming or corpus query languages.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Investigation of the Generic Features of Research Articles Published in the Bulletin of Iranian Mathematical Society

In light of the understanding that the analysis of the generic features of different academic genres can enhance the ability of non-native members of academic discourse communities to understand, and where appropriate, to produce them, the present study aimed at investigating the dominant generic structure of research articles in mathematics. To start with a relatively narrow focus, a corpus of...

متن کامل

Examining the Generic Features of Thesis Acknowledgments: A Case of Iranian MA Graduate Students Majoring in Teaching to Speakers of Other Languages (AZFA) and TEFL

Thesis acknowledgement is a written genre in which MA graduate students offer their gratitude to individuals, who have contributed to the completion of their study. The aim of the current study was to examine the thesis acknowledgements written by Iranian MA students in the field of Persian Language Teaching to Non-Persian Speakers (Amouzeshe Zaban e Farsi be Kharejian, AZFA) and TEFL in terms ...

متن کامل

Research Article Introductions: Sub-disciplinary Variations in Applied Linguistics

The present study aimed to investigate the generic organization of research article introductions in local Iranian and international journals in English for Specific Purposes, English for General Purposes, and Discourse Analysis. Overall, 120 published articles were selected from the established journals representing the above subdisciplines. Each subdiscipline was represented by 20 local and 2...

متن کامل

A Cross-Disciplinary Genre Analysis of Rhetorical Features of Research Article Introductions Written by Iranians

The notion of genre has received a great deal of attention both in discourse analytic studies as well as in the field of ESP/EAP course design. The present paper has attempted to use genre analysis to account for the rhetorical features of research article introductions written by Iranian academics in two disciplinary fields of Education and Economics. The corpus comprised 40 research article i...

متن کامل

Exploring Sub-Disciplinary Variations and Generic Structure of Applied Linguistics Research Article Introductions Using CARS Model

This paper explores sub-disciplinary variations and generic structure of research article introductions (RAIs) within three sub-disciplines of applied linguistics (AL); namely, English for Specific Purposes (ESP), Psycholinguistics, and Sociolinguistics, using Swales’(1990) CARS model. The corpus consisted of 90 RAIs drawn from a wide range of refereed journals in the corresponding sub-discipli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016